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Abstract 

We present a new approach to constructing unconditional pseudorandom generators 
against classes of functions that involve computing a linear function of the inputs. 
We give an explicit construction of a pseudorandom generator that fools the discrete 
Fourier transforms of linear functions with seed-length that is nearly logarithmic (up 
to polyloglog factors) in the input size and the desired error parameter. Our result 
gives a single pseudorandom generator that fools several important classes of tests 
computable in logspace that have been considered in the literature, including halfspaces 
(over general domains), modular tests and combinatorial shapes. For all these classes, 
our generator is the first that achieves near logarithmic seed-length in both the input 
length and the error parameter. Getting such a seed-length is a natural challenge in 
its own right, which needs to be overcome in order to derandomize RL - a central 
question in complexity theory. 

Our construction combines ideas from a large body of prior work, ranging from 
a classical construction of [NN93] to the recent gradually increasing independence 
paradigm of [KMN11, CRSW13, GMR + 12], while also introducing some novel ana¬ 
lytic machinery which might find other applications. 
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1 Introduction 


A central goal of computational complexity is to understand the power that randomness 
adds to efficient computation. The main questions in this area are whether BPP = P 
and RL = L, which respectively assert that randomness can be eliminated from efficient 
computation, at the price of a polynomial slowdown in time, and a constant blowup in 
space. It is known that proving BPP = P will imply strong circuit lower bounds that seem 
out of reach of current techniques. In contrast, proving RL = L, could well be within reach. 
Indeed, bounded-space algorithms are a natural computational model for which we know 
how to construct strong pseudo-random generators , PRGs, unconditionally. 

Let RL denote the class of randomized algorithms with O(logn) work space which can 
access the random bits in a read-once pre-specified order. Nisan [Nis92] devised a PRG of 
seed length 0(log 2 (n/e)) that fools RL with error e. This generator was subsequently used 
by Nisan [Nis94] to show that RL C SC and by Saks and Zhou [SZ99] to prove that RL can 
be simulated in space 0(log 3//2 n). Constructing PRGs with the optimal 0(log(n/e)) seed 
length for this class and showing that RL = L is arguably the outstanding open problem in 
derandomization (which might not require a breakthrough in lower bounds). Despite much 
progress in this area [INW94, NZ96, RR99, Rei08, RTV06, BRRY14, BV10, KNP11, Dell, 
GMR + 12], there are few cases where we can improve on Nisan’s twenty year old bound of 
0(log 2 (n/e)) [Nis92], 

1.1 Fourier shapes 

A conceptual contribution of this work is to propose a class of functions in RL which we call 
Fourier shapes that unify and generalize the problem of fooling many natural classes of test 
functions that are computable in logspace and involve computing linear combinations of 
(functions of) the input variables. In the following, let Ci = {z : \z\ < 1} be the unit-disk 
in the complex plane. 

Definition 1. A (m, n)-Fourier shape / : [ m] n —>• Ci is a function of the form /(aq,..., x n ) = 
n/=i fji x j ) where each fj : [m] —>• Ci. We refer to m and n as the alphabet size and the 
dimension of the Fourier shape respectively. 

Clearly, (m, n)-Fourier shapes can be computed with O(logn) workspace, as long as 
the bit-complexity of log(/)) is logarithmic for each j; a condition that can be enforced 
without loss of generality. Since our goal is to fool functions / : {0, l} n —>• {0,1}, it might be 
unclear why we should consider complex-valued functions (or larger domains). The answer 
comes from the discrete Fourier transform which maps integer random variables to Ci. 
Concretely consider a Boolean function / : {0, l} n —>• {0,1} of the form f(x) = w j x j) 
where x € {0, l} n , Wj G Z, and g : Z —>• {0,1} is a simple function like a threshold or a mod 
function. To fool such a function /, it suffices to fool the linear function w(x) = 'ffj WjXj. 

A natural way to establish the closeness of distributions on the integers is via the discrete 
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Fourier transform. The discrete Fourier transform of w(x) at a € [0,1] is given by 

n 

<f a {w(x)) = exp(27ria • w(x)) = JJ exp(2niawjXj) 

3 =1 


which is a Fourier shape. 

Allowing a non-binary alphabet m not only allows us to capture more general classes 
of functions (such as combinatorial shapes), it makes the class more robust. For instance, 
given a Fourier shape / : {0, l} n —>• C, if we consider inputs bits in blocks of length 
log(m), then the resulting function is still a Fourier shape over a larger input domain [m] 
(in dimension n/log(m)). This allows certain compositions of PRGs and simplifies our 
construction even for the case m = 2. 


1.1.1 PRGs for Fourier shapes and their applications. 

A PRG is a function Q : {0, l} r —>• [m] n . We refer to r as the seed-length of the generator. 
We say Q is explicit if the output of Q can be computed in time poly(n). 1 

Definition 2. A PRG Q : {0, l} r —» [ m] n fools a class of functions T = {/ : [m] n —»• C} 
with error e (or s-fools T) if for every f £ T, 


E [/(*)]- E {f(G(y))} 

x6uH" J/G u {0,l} r 


< £. 


We motivate the problem of constructing PRGs for Fourier shapes by discussing how 
they capture a variety of well-studied classes like halfspaces (over general domains), com¬ 
binatorial rectangles, modular tests and combinatorial shapes. 


PRGs for halfspaces. Halfspaces are functions h : {0, l} n —>• {0,1} that can be repre¬ 
sented as 

h(x) = l + ((w;,a:) — 6) 

for some weight vector w € and threshold 0 € Z where l + (a) = 1 if a > 0 and 
0 otherwise. Halfspaces are of central importance in computational complexity, learn¬ 
ing theory and social choice. Lower bounds for halfspaces are trivial, whereas the prob¬ 
lem of proving lower bounds against depth-2 TCo or halfspaces of halfspaces is a frontier 
open problem in computational complexity. The problem of constructing explicit PRGs 
that can fool halfspaces is a natural challenge that has seen a lot of exciting progress 
recently [DGJ + 09, MZ13, Kanllb, Kanl4, KM15]. The best known PRG construction 
for halfspaces is that of Meka and Zuckerman [MZ13] who gave a PRG with seed-length 
0(logn + log 2 (l/e)), which is 0(log 2 (n)) for polynomially small error. They also showed 

throughout, for a multi-set S, x £ u S denotes a uniformly random element of S. 
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that PRGs against RL with inverse polynomial error can be used to fool halfspaces, and thus 
constructing better PRGs for halfspaces is a necessary step towards progress for bounded- 
space algorithms. However, even for special cases of halfspaces (such as derandomizing the 
Chernoff bound), beating seed-length 0(log 2 (n)) has proved difficult. 

We show that a PRG for (2, n)-Fourier shapes with error e/v? also fools halfspaces with 
error s. In particular, PRGs fooling Fourier shapes with polynomially small error also fool 
halfspaces with small error. 


PRGs for generalized halfspaces. PRGs for (m, n)-Fourier shapes give us PRGs for 
halfspaces not just for the uniform distribution over the hypercube, but for a large class 
of distributions that have been studied in the literature. We can derive these results in a 
unified manner by considering the class of generalized halfspaces. 


Definition 3. A generalized halfspace over [m] n is a function g : [m] n —>• {0,1} that can 
be represented as 


g{x) = 1 + 


^Z 9 j (xj)-e 


\j =i 


where gj : [m] —>• R are arbitrary functions for j € [n] and 0£l. 


PRGs for (m, n)-Fourier shapes imply PRGs for generalized halfspaces. This in turn 
captures settings of fooling halfspaces with respect to the Gaussian distribution and the 
uniform distribution on the sphere [KRS12, MZ13, Kanl4, KM15], and a large class of 
product distributions over R n [GOWZIO]. 


Derandomizing the Chernoff-Hoeffding bound. A consequence of fooling general¬ 
ized halfspaces is to derandomize Chernoff-Hoeffding type bounds for sums of independent 
random variables which are ubiquitous in the analysis of randomized algorithms. We state 
our result in the language of “randomness-efficient samplers” (cf. [Zuc97]). Let X \,.... X n 
be independent random variables over a domain [rn\ and let gi,... ,g n '■ [m] —>• [—1,1] be 
arbitrary bounded functions. The classical Chernoff-Hoeffding bounds [Hoe63] say that 


Pr 


2=1 2=1 


> t 


< 2exp(—f 2 /4n). 


There has been a long line of work on showing sharp tail bounds for pseudorandom se¬ 
quences starting from [SSS95] who showed that similar tail bounds hold under limited 
independence. But all previous constructions for the polynomial small error regime re¬ 
quired seed-length 0(log 2 (n)). PRGs for generalized halfspaces give Chernoff-Hoeffding 
tail bounds with polynomially small error, with seed-length 0(log(?r)). 
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PRGs for modular tests. An important class of functions in L is that of modular tests , 
i.e., functions of the form g : {0, l} n —>• {0,1}, where g(x) = 1(JT mod m € S), for 
m < M, coefficients a* € Z m and S C Z m . Such a test is computable in L as long as 
M < poly(n). The case when m = 2 corresponds to small-bias spaces, for which optimal 
constructions were first given in the seminal work of Naor and Naor [NN93]. The case 
of arbitrary m was considered by [LRTV09] (see also [MZ09]), their generator gives seed- 
length Oi\og{n/e) + log 2 (M)). Thus for M = poly(ra), their generator does not improve 
on Nisan’s generator even for constant error e. PRGs fooling (2, n)-Fourier shapes with 
polynomially small error fools modular tests. 

PRGs for combinatorial shapes. Combinatorial shapes were introduced in the work 
of [GMRZ13] as a generalization of combinatorial rectangles and to address fooling linear 
sums in statistical distance. These are functions / : [m] n —» {0,1} of the form 

f(x) = h 

V»=i 

for functions g.; : [m] —>• {0,1} and a function h : {0, ...,n} —>• {0,1}. The best previ¬ 
ous generators of [GMRZ13] and [Del4] for combinatorial shapes achieve a seed-length of 
0(log(mn)+log 2 (l/e:)), 0(log m + log(n/£) 3 / 2 ); in particular, the best previous seed-length 
for polynomially small error was 0(log 3//2 (?r)). PRGs for (m, n)-Fourier shapes with error 
e/n imply PRGs for combinatorial shapes. 

Combinatorial rectangles are a well-studied subset of combinatorial shapes [EGL + 98, 
ASWZ96, LLSZ97, Lu02], They are functions that can be written as f(x) = }}, 1 ('Xj € Aj) 
for some arbitrary subsets Aj C [m]. The best known PRG due to [GMR + 12, GY14] gives 
a seed-length of 0(\og(jnn/e) log log(mn/e)). Combinatorial rectangles are special cases of 
Fourier shapes so our PRG for (m, n)-Fourier shapes also fools combinatorial rectangles, 
but requires a slightly longer seed. The alphabet-reduction step in our construction is 
inspired by the generator of [GMR + 12, GY14], 

1.1.2 Achieving optimal error dependence via Fourier shapes. 

We note that having generators for Fourier shapes with seed-length 0(log(?r)) even when 
e is polynomially small is essential in our reductions: we sometimes need error e/poly(n) 
for Fourier shapes in order to get e error for our target class of functions. Once we have 
this, starting with e a sufficiently small polynomial results in polynomially small error for 
the target class of functions. 

We briefly explain why previous techniques based on limit theorems were unable to 
achieve polynomially small error with optimal seed-length, by considering the setting of 
halfspaces under the uniform distribution on {0, l} n . Fooling halfspaces is equivalent to 
fooling all linear functions L(x) = w i x i i n Kolmogorov or cdf distance. Previous work 
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on fooling halfspaces [DGJ + 09, MZ13] relies on the Berry-Esseen theorem, a quantiative 
form of the central limit theorem, to show that the cdf of regular linear functions is close 
to that of the Gaussian distribution, both under the uniform distribution and under the 
pseudorandom distribution. However, even for the majority function (which is the most 
regular linear function), the discreteness of Y]i x i means that the Kolmogorov distance from 
the Gaussian distribution is 1 /y/n, even when x is uniformly random. Approaches that 
show closeness in cdf distance by comparison to the Gaussian distribution seem unlikely 
to give polynomially small error with optimal seed-length. 

We depart from the derandomized limit theorem approach taken by several previous 
works [DGJ + 09, DKN10, GOWZIO, HKM12, GMRZ13, MZ13] and work directly with the 
Fourier transform. A crucial insight (that is formalized in Lemma 9.2) is that fooling the 
Fourier transform of linear forms to within polynomially small error implies polynomially 
small Kolmogorov distance. 

1.2 Our results 

Our main result is the following: 

Theorem 1.1. There is an explicit generator Q : {0,l} r —>• [m] n that fools all ( m,n )- 
Fourier shapes with error e, and has seed-length r = 0(log(mn/e) • (loglog(mn/e)) 2 ). 

We now state various corollaries of our main result starting with fooling halfspaces. 

Corollary 1.2. There is an explicit generator Q : {0, l} r —>• {0, l} n that fools halfs¬ 
paces over {0, l} n under the uniform distribution with error e, and has seed-length r = 
0(log(n/e)(log log(n/e)) 2 ). 

The best previous generator due to [MZ13] had a seed-length of 0(logn + log 2 (l/e)), 
which is 0(log 2 n) for polynomially small error e. 

We also get a PRG with similar parameters for generalized halfspaces. 

Corollary 1.3. There is an explicit generator Q : {0, l} r —>• [m] n that e-fools generalized 
halfspaces over [m} n , and has seed-length r = 0(\og(mn/e) ■ (loglog (mn/e)) 2 ). 

From this we can derive PRGs with seed-length 0(log(n/e)(loglog(n/e)) 2 ) for fooling 
halfspaces with error e under the Gaussian distribution and the uniform distribution on 
the sphere. Indeed, we get the following bound for arbitrary product distributions over 
M n , which depends on the 4 th moment of each co-ordinate. 

Corollary 1.4. Let X be a product distribution on R n such that for all i € [n\, 

E[W] = 0,E[X 2 ] = l,E[Xf] < C. 

There exists an explicit generator Q : {0, l} r —>• M n such that ifY = Q{z), then for every 
halfspace h : M n —> {0,1}, 

|E[/i(X)] — E[h(K)]| < e. 


5 


The generator G has seed-length r = 0(\og(nC / e)(\og\og(nC / e)) 2 ). 

This improves on the result of [GOWZIO] who obtained seedlength 0(\og{nC/e) log (C/e) 
for this setting via a suitable modification of the generator from [MZ13]. 

The next corollary is a near-optimal derandomization of the Chernoff-Hoeffding bounds. 
To get a similar guarantee, the best known seed-length that follows from previous work 
[SSS95, MZ13, GOWZIO] was 0(log(mn) + log 2 (l/e)). 

Corollary 1.5. Let X\,, X n be independent random variables over the domain [m]. 
Let g\,...,g n : [m] —>• [—1,1] be arbitrary bounded functions. There exists an explicit 
generator G : {0, l} r —> [m] n such that if (Yi,..., Y n ) = G(z) where z G u {0, l} r , then Yi 
is distributed identically to Xi and 


Pr 


Y^9i(Yi)-Y^n9i(Yi)} 

i= 1 i =1 


> t 


< 2exp(— t 2 /2n) + e. 


G has seed-length r = 0(log(mn/e)(loglog(mn/e)) 2 ). 


We get the first generator for fooling modular tests whose dependence on the modulus 
M is near-logarithmic. The best previous generator from [LRTV09] had a seed-length of 
0(log(n/e) + log 2 (M)), which is 0(log 2 n) for M = poly(n). 

Corollary 1.6. There is an explicit generator Q : {0, l} r —>• {0, l} n that fools all linear 
tests modulo m for all m < M with error e, and has seed-length r = 0(log(Mn/e) ■ 
(log log (M n/e )) 2 ). 


Finally, we get a generator with near-logarithmic seedlength for fooling combinatorial 
shapes. [GMRZ13] gave a PRG for combinatorial shapes with a seed-length of 0(log(mn) + 
log 2 (l/e)). This was improved recently by De [Del4] who gave a PRG with seed-length 
0(log m + log(n/e) 3 / 2 ); in particular, the best previous seed-length for polynomially small 
error was 0((log(n) 3//2 ). 

Corollary 1.7. There is an explicit generator Q : {0, l} r —» [m] n that fools ( m,n )- 
combinatorial shapes to error e and has seed-length r = 0(log(mra/e)(log log(mn/e)) 2 ). 


1.3 Other related work 

Starting with the work of Diakonikolas et al. [DGJ + 09], there has been a lot of interest 
in constructing PRGs for halfspaces and related classes such as intersections of halfspaces 
and polynomial threshold functions over the domain {±l} n [DKN10, GOWZIO, HKM12, 
MZ13, Kanllb, Kanlla, Kanl4], Rabani and Shpilka [RS10] construct optimal hitting set 
generators for halfspaces over {±l} n ; hitting set generators are weaker than PRGs. 

Another line of work gives PRGs for halfspaces for the uniform distribution over the 
sphere (spherical caps ) or the Gaussian distribution. For spherical caps, Karnin, Rabani 
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and Shpilka [KRS12] gave a PRG with a seed-length of 0(log n + log 2 (l/e)). For the 
Gaussian distribution, [Kanl4] gave a PRG which achieves a seed-length of 0(logn + 
log 3//2 (l/e)). Recently, [KM15] gave the first PRGs for these settings with seedlength 
0((log(n/e))(log log(n/e))). Fooling halfspaces over the hypercube is known to be harder 
than the Gaussian setting or the uniform distribution on the sphere; hence our result gives 
a construction with similar parameters up to a O(loglogn) factor. At a high level, [KM15] 
also uses a iterative dimension reduction approach like in [KMN11, CRSW13, GMR + 12]; 
however, the final construction and its analysis are significantly different from ours. 

Gopalan et al. [GOWZIO] gave a generator fooling halfspaces under product distribu¬ 
tions with bounded fourth moments, whose seed-length is 0(log(n/e) log(l/e)). 

The present work completely subsumes a manuscript of the authors which essentially 
solved the special-case of derandomizing Chernoff bounds and a special class of halfspaces 
[GKM14], 


2 Proof overview 


We describe our PRG for Fourier shapes as in Theorem 1.1. The various corollaries are 
derived from this Theorem using properties of the discrete Fourier transform of integer¬ 
valued random variables. 

Let us first consider a very simple PRG: 0(l)-wise independent distributions over [m] n . 
At a glance, it appears to do very poorly as it is easy to express the parity of a subset of 
bits as a Fourier shape and parities are not fooled even by (n — l)-wise independence. The 
starting point for our construction is that bounded independence does fool a special but 
important class of Fourier shapes, namely those with polynomially small total variance. 
For a complex valued random variable Z, define the variance of Z as 

a 2 {Z) = E [\Z - E[Z]| 2 ] = E[|Z| 2 ] - | E[Z]| 2 . 

R is easy to verify that 

a 2 {Z) + \E[Z]\ 2 = E[\Z\\ 
so that if Z takes values in Ci, then 

a 2 (Z) + \E[Z]\ 2 < 1. 

The total-variance of a (?n, n)-Fourier shape / : [m] n —>• Ci with /(x) = \Yj = \ 
is defined as 

Tvar (/) = ^2(r 2 (fj(xj)). 

3 

To gain some intuition for why this is a natural quantity, note that Tvar(/) gives an easy 
upper bound on the expectation of a Fourier shape: 


E 

xG[m]' 


[/(*)] 


II ^ II \A ~ a2 (fj( x j)) < exp(-Tvar(/)/2). (1) 
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This inequality suggests a natural dichotomy for the task of fooling Fourier shapes. It 
suggests that high variance shapes where Tvar(/) 2> log(l/e) are easy in the sense that 
E[/] is small for such Fourier shapes. So a PRG for such shapes only needs to ensure 

that E[/] is also sufficiently small under the pseudorandom output. 

To complement the above, we show that if the total-variance Tvar(/) is very small, then 
generators based on limited independence do fairly well. Concretely, our main technical 
lemma says that limited independence fools products of bounded (complex-valued) random 
variables, provided that the sum of their variances is small. 

Lemma 2.1. Let Yf,... ,Y n be k-wise independent random variables taking values in Ci. 
Then, 

n / sr^n 

E[Yi Y n ] - J] E i Y j} < exp (0(k)) 

j =i v 

We defer discussion of the proof to Section 2.4, and continue the description of our PRG 
construction. Recall that we are trying to fool a (m, n)-Fourier shape / : [ m\ n —>• Ci with 
Tvar(/) < 0(\og{\/e) to error e = poly(l/nm). It is helpful to think of the desired error 
e as being fixed at the beginning and staying unchanged through our iterations, while m 
and n change during the iterations. Generating fc-wise independent distributions over [m] n 
takes 0(k\og(mn)) random bits. Thus if we use k = 0(log(l/e))-wise independence, we 
would achieve error e, but with seed-length 0( log(l/e) log(mn)) rather than 0(log(l/s)). 

On the other hand, if Tvar(/) < 1 /{mn) c for a fixed constant c, then choosing k = 
0(log(l/e)/(log mro))-wise independence is enough to get error e while also achieving seed- 
length Oik log(m?r)) = 0( log(l/e)) as desired. We exploit this observation by combining 
the use of limited independence with the recent iterative-dimension-reduction paradigm of 
[KMN11, CRSW13, GMR + 12], Our construction reduces the problem of fooling Fourier 
shapes with Tvar (/) < 0(log(l/e)) through a sequence of iterations to fooling Fourier 
shapes where the total variance is polynomially small in m,n in each iteration and then 
uses limited independence in each iteration. 

To conclude our high-level description, our generator consists of three modular parts. 
The first is a generator for Fourier shapes with high total variance: Tvar(/) > poly(log(l/e)). 
We then give two reductions to handle low variance Fourier shapes: an alphabet-reduction 
step reduces the alphabet m down to y/m and leaves n unchanged, and a dimension- 
reduction step that reduces the dimension from n to yfn while possibly blowing up the 
alphabet to poly(l/e). We describe each of these parts in more detail below. 

2.1 Fooling high-variance Fourier shapes 

We construct a PRG with seed-length 0(log(mn/e) log log(l/e)) which £-fools (m, n)-Fourier 
shapes / when Tvar(/) > (log^/e))^ for some sufficiently large constant C. We build the 
generator in two steps. 


Vk ) 





Iii the first step, we build a PRG with seed-length 0(\og(mn )) which achieves constant 
error for (m, n)-Fourier shapes / with Tvar(/) > 1. In the second step, we drive the 
error down to e as follows. We hash the coordinates into roughly (log(l/e))°^ buckets, 
so that for at least fl(log(l/e)) buckets, / restricted to the coordinates within the bucket 
has total-variance at least 1. We use the PRG with constant error within each bucket, 
while the seeds across buckets are recycled using a PRG for small-space algorithms. This 
construction is inspired by the construction of small-bias spaces due to Naor and Naor 
[NN93] ; the difference being that we use generators for space bounded algorithms for 
amplification, as opposed to expander random walks as done in [NN93]. 

2.2 Alphabet-reduction 

The next building block in our construction is alphabet-reduction which helps us assume 
without loss of generality that the alphabet-size m is polynomially bounded in terms of 
the dimension n. This is motivated by the construction of [GMR + 12]. 

Concretely, we show that constructing an e-PRG for (m, n)-Fourier shapes can be re¬ 
duced to that of constructing an e'-PRG for (n 4 , n)-Fourier shapes for s' ~ e/(logm). The 
alphabet-reduction step consists of (log log m) steps where in each step we reduce fooling 
(m, n)-Fourier shapes for m > n 4 , to that of fooling (y/rn, n)-Fourier shapes, at the cost 
of O (log (m/s)) random bits. 

We now describe a single step that reduces the alphabet from rri to y/rn. Consider the 
following procedure for generating a uniformly random element in [m] n : 

• For D ~ y/rri, sample uniformly random subsets 

5! = {X[l, 1],X[1, 2],... ,X[D, 1]}, ...,S n = {X[l,n],X[2, n],... , X[D, n}} C [m], 

• Sample Y = (li,..., Y n ) uniformly at random from [ D] n . 

• Output (Z i,..., Z n ), where Zj = X[Yj,j]. 

Our goal is to derandomize this procedure. The key observation is that once the subsets 
S\,... ,S n are chosen, we are left with a ( D ,n)-Fourier shape as a function of Y. So the 
choice of Y can be derandomized using a PRG for Fourier shapes with alphabet [D], and it 
suffices to derandomize the choice of the X’s. A calculation shows that (because the T’s 
are uniformly random), derandomizing the choice of the A’s reduces to that of fooling a 
Fourier shape of total-variance Lemma 2.1 implies that this can be done with 

limited independence. 

2.3 Dimension-reduction for low-variance Fourier shapes 

We show that constructing an e-PRG for (n 4 , n)-Fourier shapes / with Tvar(/) < poly(log(mn/e)) 
can be reduced to that of e'-fooling (poly(n/e), v / n)-Fourier shapes for s' « e/log n. Note 
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that here we decreased the dimension at the expense of increasing the alphabet-size. How¬ 
ever, this can be fixed by employing another iteration of alphabet-reduction. This is the 
reason why considering (m, n)-Fourier shapes for arbitrary m helps us even if we were only 
trying to fool (2, n)-Fourier shapes. The dimension-reduction proceeds as follows: 

1. We first hash the coordinates into roughly y/n buckets using a fc-wise independent 

hash function h T~i = {h : [n] [v^]} for k ~ 0(log(n/e)/logn). Note that this 

only requires 0(log(n/e)) random bits. 

2. For the coordinates within each bucket we use a k'- wise independent string in [m] n 
for k! « 0(log(n/e)/log n). We use true independence across buckets. Note that this 
requires y/n independent seeds of length r = 0(log(n/e)). 

While the above process requires too many random bits by itself, it is easy to analyze. We 
then reduce the seed-length by observing that if we fix the hash function h, then what we 
are left with as a function of the seeds used for generating the symbols in each bucket is a 
(2 r < poly(n/e), v / re)-Fourier shape. So rather than using independent seeds, we can use 
the output of a generator for such Fourier shapes. 

The analysis of the above construction again relies on Lemma 2.1. The intuition is 
that since Tvar(/) < poly(log(n/e)), and we are hashing into y/n buckets, for most hash 
functions h the Fourier shape restricted to each bucket has variance 0(l/n c ) for some fixed 
constant c > 0. By Lemma 2.1, limited independence fools such Fourier shapes. 

2.4 Main Technical Lemma 

The lemma can be seen as a generalization of a similar result proved for real-valued random 
variables in [GYM] (who also have an additional restriction on the means of the random 
variables Yj). However, the generalization to complex-valued variables is substantial and 
seems to require different proof techniques. 

We first consider the case where the Y/s not only have small total-variance, but also 
have small absolute deviation from their means. Concretely, let Yj = )Xj{ 1 + Z/) where 
E \Zj\ = 0 and \Zj\ < 1/2. In this case, we do a variable change Wj = log(l + Z/) (taking 
the principal branch of the algorithm) to rewrite 


n Y j = n /b (| + z .i) li/o ex p ( yi 

3 3 3 \ 3 

We then argue that exp(^T Wj) can be approximated by a polynomial P(W \,..., W n ) of 
degree less than k with small expected error. The polynomial P is obtained by truncating 
the Taylor series expansion of the exp( ) function. Once, we have such a low-degree polyno¬ 
mial approximator, the claim follows as limited independence fools low-degree polynomials. 
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To handle the general case where Zj's are not necessarily bounded, we use an inclusion- 
exclusion argument and exploit the fact that with high probability, not many of the Zj's 
(say more than k/ 2) will deviate too much from their expectation. We leave the details to 
the actual proof. 


3 Preliminaries 


We start with some notation: 

• For v € M n and a hash function h : [n] —> [to] , define 

m 

K v ) = H t >- 1 (j)ll2 ( 2 ) 

3 =1 

• Ci = {z : z G C, \z\ < 1} be the unit disk in the complex plane. 

• For a complex valued random variable Z, 

Var(Z) = a 2 (Z) = E [\Z - E[Z]| 2 ] . 


• Unless otherwise stated c, C denote universal constants. 

• Throughout we assume that n is sufficiently large and that 5, e > 0 are sufficiently 
small. 


• For positive functions f,g , h we write / = g + 0(h .) when | f — g\ = 0(h). 

• For a integer-valued random variable Z, its Fourier transform is given as follows: for 
a € [0,1], Z(a) = E[exp(27riaZ)]. Further, given the Fourier coefficients Z(a ), one 
can compute the probability density function of Z as follows: for any integer j, 


Pr [Z = j} 


exp(2irija)Z(a) da. 


Definition. For n, to, 6 > 0 we say that a family of hash functions H = {h : [n] —>• [to]} is 
5-biased if for any r < n distinct indices . .., i r G [n] and j i,... ,j r € [to], 

Pr [h(h) = ji A h(i 2 ) = j 2 A ■ ■ ■ A h(i r ) = j r \ = — ± 5. 
he u H to” 

We saj/ that such a family is k-wise independent if the above holds with (5 = 0 for all 
r < k. 

We say that a distribution over {±l} n is 5-biased or k-wise independent if the corre¬ 
sponding family of functions h : [n] —>• [2] is. 
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Such families of functions can be generated efficiently using small seeds. 

Fact 3.1. For n,m,k,5 > 0, there exist explicit 5-biased families of hash functions FL = 
{h : [n] —» [m]} that can be generated efficiently from a seed of length s = 0(log(n/5)). 
There are also, explicit k-wise independent families that can be generated efficiently from 
a seed of length s = 0(k log(ram)). 

Taking the pointwise sum of such generators modulo m gives a family of hash func¬ 
tions that is both 5-biased and k-wise independent generated from a seed of length s = 
0(log(n/<5) + k\og(nm)). 

3.1 Basic Results 

We start with the simple observation that to (5-fool an (m,?t)-Fourier shape/, we can 
assume the functions in / have bit-precision 21og 2 (n/<5). This observation will be useful 
when we use PRGs for small-space machines to fool Fourier shapes in certain parameter 
regimes. 

Lemma 3.2. If a PRG Q : {0, l} r —>• [m] n 5-fools (m,n)-Fourier shapes f = JX fj when 
log (fj)’s have bit precision 21og 2 (n/5), then Q fools all (m,n)-Fourier shapes with error 
at most 25. 

Proof. Consider an arbitrary (m, n)-Fourier shape / : [ m\ n —>• Ci with / = IX fj- Let 
fj : [m] — >• Ci be obtained by truncating the log(/,)’s to 21og 2 (n/<5) bits. Then, \fj(xj) — 
fj(xj) | < 6 /n for all Xj E [m]. Therefore, if we define / = Yljfji then for any x E [m] n , 
(as the fj ’s and ffs are in Ci) 


f{x) - f(x) 

= 

nxxx - YU( x j) 

< xxx - hxj) 



j 3 

3 


The claim now follows as the above inequality holds point-wise and by assumption, Q 
5-fools /. □ 

We collect some known results about pseudorandomness and prove some other technical 
results that will be used later. 

We shall use PRGs for small-space machines or read-once branching programs (ROBP) 
of Nisan [Nis92], [NZ96] and Impagliazzo, Nisan and Wigderson [INW94], We extend the 
usual definitions of read-once branching programs to compute complex-valued functions; 
the results of [Nis92], [NZ96], [INW94] apply to this extended model readily 2 . 

Definition 4 ((S, D,T)- ROBP). An (S, D,T)-ROBP M is a layered directed graph with 
T + 1 layers and 2 s vertices per layer with the following properties. 

2 This is because these results in fact give guarantees in terms of statistical distance. 
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• The first layer has a single start node and the vertices in the last layer are labeled by 
complex numbers from Ci. 

• A vertex v in layer i, 0 < i < T has 2 D edges to layer i + 1 each labeled with an 
element of {0,1} D . 

A graph M as above naturally defines a function M : ({0,1} D ) T —»• Ci where on in¬ 
put (zi,...,z T ) € ({0, 1} D ) T one traverses the edges of the graph according to the labels 
zi,..., zt and outputs the label of the final vertex reached. 

Theorem 3.3 ([Nis92], [INW94]). There exists an explicit PRG g lNn : {0, l} r —> ({0, 1} D ) T 
which e-fools (S, D,T)-branching programs and has seed-length r = 0(D + S log T + 
log(T/<5)-(logT)). 

Theorem 3.4 ([NZ96]). For all C > 1 and 0 < c < 1, there exists an explicit PRG 
G NZ : {0, l} r —»■ ({0, which e-fools (S, S, S c )-branching programs for e = 2~ logl c s 

and has seed-length r = O(S). 

The next two lemmas quantify load-balancing properties of d-biased hash functions in 
terms of the t'p-norms of vectors. Proofs can be found in Appendix A. 

Lemma 3.5. Let p > 2 be an integer. Let v € M n and FL = {h : [re] —» [rre]} be either a 
5-biased hash family for 5 > 0 or a p-wise independent family for <5 = 0. Then 

E[/i(u) p ] < 0(p) 2p + 0(p) 2p ||u||4 P + m p \\v\\2 P S. 


Lemma 3.6. For all v € MIL, let p > 2 be even and FL = {h : [re] —> [m]} a p-wise 
independent family, and j G [m], 


Pr 




> t 


< 


o { P ) p / 2 


tp 


4 Fooling products of low-variance random variables 


We now show one of our main technical claims that products of complex-valued random 
variables are fooled by limited independence if the sum of variances of the random variables 
is small. The lemma is essentially equivalent to saying that limited independence fools low- 
variance Fourier shapes. 


Lemma 4.1. Let ... ,Y n be k-wise independent random variables taking values in Ci. 
Then, 


E[i"l"-^n]-n E K-] 

3 =1 


< exp(OO)) • ( ^ j<7 7 — 


n{k) 
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More concretely, let X\ ,... ,X n be independent random variables taking values in Ci. 
Let a'f = Var(JQ) and Yi=i °f — ■ Let A: be a positive even integer and let Y \,... ,Y n 

be a Ck -wise independent family of random variables with each Yj distributed identically 
to Xi. Then, we will show that for C a sufficiently big constant, 

|E [Yi • • • T n ] — E [X\ ■ ■ ■ X n ] | = exp(0(A:)) • ( a/Vk) k . (3) 


We start with the following standard bound on moments of bounded random variables 
whose proof is deferred to appendix B. 

Lemma 4.2. Let Z\., Z n £ C be random variables with E[Zj] = 0, [|^i||oo < B and 
Yi Var(Zi ) < a 2 . Then, for all even positive integers k, 


E 




< 2 0{ - k \aVk + Bk) k . 


We also use some elementary properties of the (complex-valued) log and exponential 
functions: 


Lemma 4.3. 1. For z € C with \z\ < 1/2, | log(l + z)\ < 2\z\, where we take the 

principle branch of the logarithm. 

2. For w £ C and k > 0, 


fc-i 

exp(ie) — y] w k /k\ 
3=0 


< 0 ( 1 ) 



max(l, exp(3?(tc))). 


3. For a random variable Z £ C with \Z\oo < 1/2, E [Z\ = 0, and W = log(l + Z) 
the principle branch of the logarithm function (phase between Var(W) < 

4 Var(Z). 

4■ For any complex-valued random variable W £ C, |exp(E[W])| < E[|exp(W)|]. 

Proof. Claims (1), (2) follow from the Taylor series expansions for the complex-valued log 
and exponential functions. 

For (3), note that Var(W) < E[|W| 2 ] < 4E[|Z| 2 ] = 4 Var(Z). 

For (4), note that | exp(E[W])| = | exp(Fl[3?(W)])| and similarly | exp (W)\ = \ exp(3?(W))|. 
The statement now follows from Jensen’s inequality applied to the random variable 9i(W). 

□ 


We prove Lemma 4.1 or equivalently, Equation (3) by proving a sequence of increasingly 
stronger claims. We begin by proving that Equation (3) holds if Xj’s have small absolute 
deviation, i.e., he in a disk of small radius about a fixed point. 


14 









Lemma 4.4. Let X\ and Yj, be as above. Furthermore, assume that Yi = pt( 1 + Zj) /or 
complex numbers pi = E[Y)] and random variables Zi so that with probability 1, |Zj| < B < 
1/2 /or all i. Let of = Varf Z t ), and d 2 = Ya =i ■ ^ en we have that 

|E [Xl • • • A' n ] — E [Yi ■ ■ ■ y n ]| = exp(0(fc)) • (d/k 1 ' 2 + B) fe . 

Proof. Let Wj = log(l + Zj), taking the principle branch of the logarithm function and let 
W'j = Wj — E[TFj]. Then, by Lemma 4.3 (1), (3), | Wj\ < 2\Zj\ < 2B, so that \Wi\' < 4 B 
and Var(lFj) = 0(d 2 ). Finally, let W = Yl]=i Wj. 

Now, by Lemma 4.3 (3) 

n n 

n«-n {Pi exp(E[TF;])) exp(IF) 

2=1 1=1 

= n Of exp(E[Wi])) 7T + °( x ) ' (^rr) ■ max(l, exp(K(W))) 
i=i \e=o ' V • / 

Note that the expectation of the t ,th powers of W are fooled by the fc-wise independence 
of the F’s for £ < k. Therefore the difference in the expectations between the product of 
y’s and the product of X’s is at most 

n 

0(1) • IJ (Pi exp(E[Wj])) E 
2—1 

Now, by Lemma 4.3 (4), 


\W\ 

U" 


• max(l, exp(3?(W))) 


( 4 ) 



]~[pi exp(E[Wj]) 


2=1 


U Hi ■ exp [ E 


2=1 


Y w ‘ 


< 


fi*-E 


2=1 


exp I Wi 


= E 


•~[pi exp(Wj) 


2=1 


= E 


IT' 

2=1 


< 1. 


Further, 


'Q Pi exp(E[Wj]) 


2=1 


exp($ft(TF)) = 


j J Pi exp(E[Wj]) • exp (IF) 


2=1 


IT" 

2=1 


< 1. 


Therefore, by Lemma 4.2, the expression in (4) is at most 

k 


0(1) E 


\W\ k 

“fcT 


< 2°( fc ) • 


a 


y/k + Bk 


k 


= 2° (fc) • (d/k l/2 + B) k . 


□ 


15 


































Next, we relax the conditions to handle the case where we only require the means of 
the Xj's be far from zero. 

Lemma 4.5. Let Xj and Yi be as in Equation (3). Let fii = E[Xj]. If \fii\ > (a/Vk) 1 ^ 3 
for all i, then Equation (3) holds. 

Proof. We assume throughout that a/\fk is less than a sufficiently small constant; other¬ 
wise, there is nothing to prove. Further, note that there can be at most k different indices 
j £ [n] where aj > a/\fk. As even after conditioning on the values of the corresponding 
Y% the remaining Y/s are (C — l)/c-independent, it suffices to prove the lemma when 
aj < a/Vk for all j. 

To apply Lemma 4.4, we consider a truncation of our random variables: define 


Yi = 



if |Yi - ml < ( a/yfk ) 2/3 
else 


We claim that the variables Yi satisfy the conditions of Lemma 4.4. Let m = E[Y^]. 
Note that by Chebyshev bound, Pr(Yi / Y t ) < a 2 (a/Vk)~ 4: ^ 3 < ( a/y/k ) 2 / 3 . Therefore, 
I hi ~ hi\ < (a/Vk) 2/3 , so that \fii\ > (1/2)|/u.*|. Furthermore, letting Yj = /2j(l + Zf), we 
have that 


E [Zi] = 0, Halloo < 2 (a/Vk) 1/3 , Var(Zj) < 4 a 2 {a/Vk)~ 2 / 3 , 


Finally, note that 


^Var(^) <Aa 2 {a/y/k)- 2 ' 3 . 

i 

( 5 ) 


n*=n«-*+s)= e 

i— 1 i= 1 SC[n] i£S i^S 


We truncate the above expansion to only include terms corresponding to sets S with 
\S\ < m for m = 0{k) to be chosen later. Let 


p rn {Y U ...,Y n ) = y , nw-^n«), 

SC[ra],|S|<miGS i^S 

and let N equal the number of i so that Yi / Tj. We claim that 


'[[Y j -P m (Y 1 ,...,Y n ) 

3 =1 



The above clearly holds when N < m, since in this case for any S of size at least 
m we have — Yi) = 0. On the other hand for N > m we note that there 
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are at most (^) — 2 m {^ n ) subsets S for which this product is non-zero. Hence, 

\P m (Y 1 ,... ,Y n )\ < 2 m 0. 

We now argue that Ck- wise independence fools the individual terms of P m when m = 
O(k). This is because, the Yj for j E S are independent and conditioned on their values, the 
remaining Yj for j ^ S are still C'k- wise independent for some sufficiently large constant 
C. Therefore, applying Lemma 4.4 with parameters as given by Equation (5), Ck- wise 
independence fools P m {Y \,..., Y n ) up to error 


e n|«m-si|W f k (^y 2/3 + (^) 

SC[n],\S\<mieS \ V K VV k J \Y K J 


1/3' 


3fc 


where 

^ \fk V Vk J 

Therefore, P m is fooled to error 

m— 1 


\ - 2 /3 


\ 1/3' 


3 k 


+ (7iJ = 0 ^ 


X> 


e=o 


N 


2°^ . (a/Vkf. 


Note that the expectation above is the same as what it would be if the Y^’s were fully 
independent, in which case it is at most 


E[2*] = + Pr(y, / Yi)) < exp ( E P + E) 


i=l 


\i=l 


exp ( O ( ^a 2 (a/Vk) 


-4/3 = 


= exp(0(<r 2 / 3 fc 2 / 3 )) = exp(0(fc)). 


\i=l 


Therefore, Ck -wise independence fools P m to error 2°^ ■ (a/\fk) k . 
On the other hand, the expectation of is 


E Unto**)* 

SC[n],|5|=mi€S 


(Er=iP^^^)) ? 


ml 


< 


(Eli 


ml 


<0({a 2 /m)(a 2 /k)- 2 ^y . 

Taking m = 2>k/2 yields a final error of exp (O(k)) ■ (a/Vk) k . This completes our proof. □ 
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Finally, we can extend our proof to cover the general case. 

Proof of Lemma f.l. Note that it suffices to prove that Equation (3) holds. As before, it 
suffices to assume that a/Vk <C 1 and that a t < a/Vk for all i. 

Let m be the number of i so that | E[A^] | < ( a/y/k ) 1//3 . Assume that the Ws with small 
expectation are Yi,..., Y m . We break into cases based upon the size of m. 

On the one hand if m < 6k, we note that for C sufficiently large, the values of Y \,..., Y m 
are independent of each other, and even after conditioning on them, the remaining Y^’s are 
still C'k-wise independent. Thus, applying Lemma 4.5 to the expectation of the product 
of the remaining Y % we find that the difference between the expectation of the product of 
X's and product of Y’s is as desired. 

For m > 6k we note that 


E 


n v 


_i=1 


[]|E[Y:]| < (a/Vk) m / 3 . 


i=l 


Therefore, it suffices to show that 


E 


IT'. 

i=1 


= 0(a/Vk) h 


Notice that so long as at least 3k of Y\,... ,Y m have absolute value less than 2(a/Vk ) 1 / 3 , 
then 


n> 

2—1 


= 0{(j/Vk) k . 


Therefore, it suffices to show that this occurs except with probability at most 0(a/Vk) k . 
Let N be the number of 1 < i < m so that |Yj| > 2(a/y/k) 1 ^ 3 . Note that 


E [N] =£pr(|y i | > 2{a/Vk) 1/3 ) < <r?(<T/Vk)~ 2/3 < a 2 (a 2 /fe)" 1/3 . 


2=1 


2=1 
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On the other hand, we have that 


Pr(IV > 3k) < E 



e n pr d^i^w^) i/3 ) 

SC[m],\S\=3kieS 

_ (E£iPr(IVI>2^/Vr)l/3))” 

£ (3fe)! 

_ E[IV] 3fc 

my- 

< 0((a 2 /k) 2/3 ) 3k 


< 0(a/Vk) k . 


This completes the proof. 


□ 


5 A Generator for high-variance Fourier shapes 

In this section, we construct a generator that fools Fourier shapes with high variance. 

Theorem 5.1. There exists a constant C > 0, such that for all 5 > 0, there exists an 
explicit generator Qt : {0, l} re —>• [m] n with seed-length ri = 0(\og(mn/8) log log(l/J)) 
such that for all Fourier shapes f : [m] n —» Ci with Tvar (/) > CTog 5 (l/<5), we have 


E [/(&(*))]- E [f(X)j 

z~{0,l} e XG u [m] rl 


< 8 . 


We start with the simple but crucial observation that Fourier shapes with large variance 
have small expectation. 


Lemma 5.2. For any Fourier shape f : [m] n —>• Ci, we have 


E 

xe u [m] n 


lf(x)} 


< exp(— Tvar(/)/2). 


( 6 ) 


Proof. Let f(x) = [] ? Since fj(x) € Ci, we have \fj(x)\ < 1. Let Hj = [fj( x j)\- 

For X [m] n , 


a 2 = E[| fj{Xj) - ///] = E[|/j(Xj)| 2 ] - \hj\ 2 < 1 - \Fj\ 2 
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Hence 


n 


n 


|[/P0] 


nN ^ n<w) i/a 


J = 1 j=l 

n 

< exp(-^V|/2) < exp(—Tvar(/)/2). 
j=i 


□ 

We build the generator in two steps. We first build a generator with seed-length 0(log n) 
which achieves constant error for all / with Tvar(/) >1. In the second step, we reduce the 
error down to 5. This construction is inspired by a construction of Naor and Naor [NN93] 
of small-bias spaces. 


5.1 A generator with constant error 

Our goal in this subsection is get a generator with constant error for Fourier shapes where 
Tvar(/) = 0(1). We start by showing that when Tvar(/) = 0(1) (instead of just 11(1)), 
0(l)-wise independence is enough to fool /. 


Lemma 5.3. For all constants 0 < c\ < C 2 , there exist p £ Z + and 0 < c' < 1 such that 
the following holds. For any ( m,n)-Fourier shape, f with Tvar(/) £ [ci,C 2 ], and Z ~ [m] n 
2p-wise independent, 


I am 


< c. 


Proof. Let / = n, fji A £ u [ m] n . Now, by Lemma 4.1 applied to Yj = fj(Zf), we have, 


mm - E[/(X)]| < exp(O0p))(Tvar(/)/v^ = exp(0(p))(c 2 /Vp) n(p) . 


Note that by taking p to be a sufficiently large constant compared to c 2 , we can make the 
last bound arbitrary small. 

On the other hand, by Equation (6), 

m( x )\\ < exp(-Tvar(/)/2) < exp(-ci/2). 


Therefore, 

\mm\ < exp(-ci/2) +exp (0(p))(c 2 /y/p) n(p) < d 
for p sufficiently large constant and some constant 0 < d < 1. □ 

We reduce the general case of Tvar(/) £ [l,n] to the case above where Tvar(/) = 0(1) 
by using the Valiant-Vazirani technique of sub-sampling. For B C [n] let Tvar(/^) = 
YlieB a h K we sample a random subset B C [n] with \B\ ~ n/Tvar(/) in a pairwise 
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independent manner, we will get Tvar (f B ) = 0(1) with 11(1) probability. Since we do not 
know Tvar(/), we sample log(n) subsets whose cardinalities are geometrically increasing; 
one of them is likely to satisfy the desired bound. 

We set up some notation that will be used in the remainder of this section. 


• Assume n is a power of 2, and set T = log 2 (n) — 1. Let II C §„ be a family of 
pairwise independent permutations so that n G u II can be sampled efficiently with 
O(logn) random bits. For 0 < j < T, let Bj = (7r(i) : i € {2 J ,..., 2 J+1 — 1}} be the 
2 J co-ordinates that land in the j th bucket. 

• For v € M n , let v 3 = v Bj denote the projection of v onto coordinates in bucket j. 
Similarly, for x € [m] n , let x 3 denote the projection of x to the co-ordinates in Bj. 

• Fix an (m, n)-Fourier shape / : [m] n —> Ci with f(x) = Ylifi( x i)- Define f 3 : 
[m] Bj -> Ci as PixJ) = n isB j /»(*»)■ 

Lemma 5.4. Let v € M" with \\v\\\ € [1 ,n], Halloo < 1 and t € [log 2 n] be such that 
n/2 t+1 < ||uH 2 < n/2 t . Then, 


Pr 

7re u n 



€ [1/6,4/3] >7/16 


The proof of this lemma is standard and is deferred to Appendix C. 

This naturally suggests using an 0(l)-wise independent distribution within each bucket. 
But using independent strings across the log(n) buckets would require a seed of length 
0(\og{mn) ■ (logn)). We analyze our generator assuming independence across distinct 
buckets, but then recycle the seeds using PRGs for space bounded computation to keep the 
seed-length down to O(log(rrm)) (rather than 0(log 2 (re))). 

We now prove the main claim of this subsection. 

Lemma 5.5. There exists an explicit generator Q\ : {0, l} r —>• [m] n with r = 0(\og(mn)) 
such that for all Fourier shapes f : [m] n — >• Ci with Tvar (/) > 1, we have 


E 

z~{0,l} r 




< c. 


for some constant 0 < c < 1. 

Proof. Let ir II and let Zi ~ [m] 2J be an independent p-wise independent string for a 
parameter p = 0(1) to be chosen later. Define 

Q[ (tt, Z°,..., Z T ) = Y, where V Bj = ^ for j € {0,..., T}. 

In other words, the generator applies the string to the coordinates in bucket Bj. 
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Observe that f(Y ) = 1 fH^)- Since the Z- 7 ’s are independent of each other 


mf(Y)]\ 


log(n)-l 

n nf j (z j )} 


j =0 


< N/*(Z% 


for any t < T. Applying Lemma 5.3 to v = (ai(fi),..., cr n (fj)), we get that for some 
t <T, Tva r(f t ) = || G [1/6,4/3] with probability at least 7/16. Conditioned on this 
event, Lemma 5.3 implies that for p a sufficiently large constant, there exists a constant 
d < 1 so that |E[/ f (Z*)]| < d. Therefore, overall we get 

|E[/(y)]| < |e[/*(z*)]| < A += c " < l. 

We next improve the seed-length of G[ using the PRG for ROBPs of Theorem 3.4. To 
this end, note that by Lemma 3.2 we can assume that every log (fi(xi)), and hence every 
\og(f J has bit precision at most O(logn) bits (since our goal is to get error <5 = 0(1)). 
Further, each Z J can be generated efficiently with 0(log(mn)) random bits. 

Thus, for a fixed permutation n, the computation of f{G'{ 7r, Z 1 ,..., Z T )) can be done 
by a (S,D,T)- ROBP where S,T are O(logn) and D = 0(log(rnn)): for j G {1,...,T}, 
the ROBP computes f J (Z J ) and multiplies it to the product computed so far, which can 
be done using O(logn) bits of space. Let Q NZ : {0, l} r —> ({0,1} D ) T be the generator 
in Theorem 3.4 fooling (S, D, T)-ROBPs as above with error <5 < (1 — c")/ 2. Q NZ has 
seedlength 0(log(mn)). Let 

Qi(n,z) = G'i(tt,G NZ (z)). 


It follows that \¥,[f(Gi(n,z))]\ < c for some constant c < 1. Finally, the seed-length of 
G i is 0(\og{mn)) as ir can be sampled with O(logn) random bits and the seed-length of 
G nz is 0(log(mn)). The lemma is now proved. □ 


5.2 Reducing the error 

We now amplify the error to prove Theorem 5.1. The starting point for the construction 
is the observation that for X G u [ m] n , |E[/(A)]| < exp(— Tvar(//2)) < 5 once Tvar(/) 
log(l/5). Therefore, it suffices to design a generator so that E[/] <C 5 , when Tvar(/) is 
sufficiently large. 

Our generator will partition [n] into rn = 0((log(l/i5)) 5 ) buckets B i, ..., B m . using a 
family of hash functions with the following spreading property: 

Definition 5. A family of hash functions 7~L = {h : [n] —>• [m]} is said to be (B,£,5)- 
spreading if for all v G [0, l] n with \\v\\\ > B, 

, p r [\{j € [m] : \\v h -i^ ^ > B/2m}\ >£}>!-8. 
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Using the notation from the last subsection, we write f(x) = IT ; =i f 3 ( x3 ) where 
/'(•'•') = I fi(xi). If Tvar(/) is sufficiently large, then the spreading property guar¬ 
antees that for at least D(log(l/5)) of the buckets Bj, Tvar (/■?’) > 1. If we now generate 
X € [ m] n by setting Xbj to be an independent instantiation of the generator Q\ from 
Lemma 5.3, then we get E [f(X)] <C 5. As in the proof of Lemma 5.5, we keep the 
seed-length down to 0(log(n/<5)) by recycling the seeds for the buckets using a PRG for 
small-space machines. 

We start by showing that the desired hash functions can be generated from a small-bias 
family of hash functions. We show that it satisfies the conditions of the lemma by standard 
moment bounds. The proof is in Appendix C 

Lemma 5.6. For all constants C\, there exist constants 62,63 such that following holds. 
For all 6 > 0, there exists an explicit hash family TL = {h : [n\ —> [T]} ; where T = 
62 log 5 (1 /<?)) which is (63 log 5 (l/5), 61 log(l/5), 5)-spreadm<? and h € u TL can be sampled 
efficiently with 0(log(n/5)) bits. 

We are now ready to prove Theorem 5.1. 

Proof of Theorem 5.1. Let i = 61og(l/<5) for some constant to be chosen later and let TL = 
{h : [n} —>• [T]} be a [B,i, <5)-spreading family as in Lemma 5.6 above for B = 0(log 5 (l/<5)) 
and T = 0(log 5 (l/<5)). Let Qi : {0, l} ri —>• [m] n be the generator in Lemma 5.3. Define a 
new generator Q' t : TL x ({0, l} r ) T —>• [ m] n as: 

G'e(h, z 1 ,..., z T ) = X, where X h -i^ = Gi(z j ) for j € [T], 

Let / : [m] n Ci with Tvar(/) > max(2 T,B). For h € TL, let I = {j : Tvar fi > 1}. 
For any fixed h € TL, as the z v s are independent of each other, 

m 

\nnm = n \np< n \npmz j ))\\ < 

3 = 1 fe/ 

where c < 1 is the constant from Lemma 5.3. By the spreading property of TL, with 
probability at least 1 — 5, |/| > 6Tog(l/5). Therefore, for C sufficiently large, 

|E[/(X)]| < ( 5 + c cl °g( 1 /' 5 ) < 25. 

As in Lemma 5.5, we recycle the seeds for the various buckets using the PRGs for 
ROBPs. By Lemma 3.2, we may assume that / 3 has bit precision at most 0(log(n/5)) 
bits. Further note that 


3= 1 
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For a fixed hash function h E PL, this can be computed by a (S,D,T)- ROBP where 
S = 0(\og(n/S)) and D = 0(log(mn)), corresponding to the various possible seeds for Q\. 
Let Q inw ; {0, l} r —»• ({0,1} D ) T be a generator fooling (S,D,T )~ROBPs as in Theorem 
3.3 with error 5 and define 

Gt(h,z) = g' t {h,g INW (z)). 

The seed-length is dominated by the seed-length of Q INW ; which is 
0{\og{mn/5) logT) = 0(\og(mn/5) log log(l/<5)). 

It follows that \K[f(Q((h,z))]\ < 3<5, whereas for a truly random Y E u [ m] n , 

|E[/(T)]| < exp(— Tvar(/)/2) <5. 


The theorem now follows. □ 

6 Alphabet reduction for Fourier shapes 

In this section, we describe our alphabet-reduction procedure, which reduces the general 
problem of constructing an e-PRG for (m, n)-Fourier shapes where m could be much larger 
than n, to that of constructing an e/ log(m)-PRG for (n , n)-Fourier shapes. This reduction 
is composed of 0(log log m) steps where in each step we reduce fooling (m, n)-Fourier shapes 
to fooling ( y/rn , n)-Fourier shapes. Each of these steps in turn will cost 0(log(m/e)) random 
bits, so that the overall cost is O (log (m/e) ■ (log log rri )). Concretely, we show the following: 

Theorem 6.1. Let n, 5 > 0 and suppose that for some r' = r'(n, 5'), for all m! < n 4 there 
exists an explicit generator Q m > : {0, l} ri —>• [m'] n which 5'-fools (m 1 ,n)- Fourier shapes. 
For all m, there exists an explicit generator Q m : (0, l} r —>• [m] n which ( 5' + S)-fools 
(■ m , n)-Fourier shapes with seed-length r = r' + 0(log(m/<5) log log(m)). 

Proof. We prove the claim by showing that for m > n 4 , we can reduce ((J+^-fooling (m, n)- 
Fourier shapes to that of (f'-fooling (y/rn, n)-Fourier shapes with 0(\og(m/6)) additional 
random bits. The theorem follows by applying the claim loglog(m) until the alphabet size 
drops below ?r 4 when we can use Q m >. This costs a total of r' + 0(fog(in/5) log log(m)) 
random bits, and gives error 5' + loglog(m)5. The claim follows by replacing <5 with 
5/ log log(m). 

Thus, suppose that m > n 4 and for D = > we have a generator Qo ■ {0, l} rD —>• 

[. D] n which d'-fools (D, n)-Fourier shapes. The generator Q m works as follows: 

1. Generate a matrix X E [m] Dxn where 

• Each column of X is from a pairwise independent distribution over [m] D . 

• The different columns are fc-wise independent for k = CTog(l/(f)/log(m) for 
some sufficiently large constant C. 
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2. Generate Y = (Y\ ,... ,Y n ) = Gd(z) G [D] n for z G u {0, l} r °. 

3. Q rn outputs Z = (Z \,..., Z n ) G [m] n where Zj = X[lj, j] for j G [n]. 

Each column of X can be generated using a seed of length 2 log m. By using seeds for var¬ 
ious columns that are fc-wise independent, generating X requires seedlength 0(k\ogm ) = 
0(log(l/(5)) (as m > n 2 ), while the number of bits needed to generate Z is rD+0(log(l/<5)). 

Fix an (m,n)-Fourier shape / : [ m] n —>• Ci, f(z ) = n, fj(zj). For x G [m] Dxn , define 
a (D, n)-Fourier shape f x : [D] n —> Ci by: 


f x {yi,---,y n ) = 

i=l 


Note that f(Z) = f x (Y'). 

Let X', Y' be random variables distributed uniformly over [m] I)xn and [D] n respectively. 
Let Zj = X'fYj, j] for j G [n], so that Z' is uniform over [m] n and f(Z') = f x '(Y'). Our 
goal is to show that f(Z') and f(Z) are close in expectation. We do this by replacing X' 
and Y' by X and Y respectively. 

That we can replace Y' with Y follows from the pseudorandomness of Gd- For any 
fixed x G [m] n , as Gd fools (D, n)-Fourier shapes, 


E \f x (Y) 1- E \f x (Y')}\ < 5'. 
Y=g D (z) u y’& u [d\Y n] 


We now show that for truly random Y\ one can replace X by X'. Note that 


( 7 ) 


y^W r{¥ ')] = n (^ • (E/iWM))) - (8) 

where we define the bias-function Bf : [m\ Dxn —>• Ci as above. We claim that X fools Bf. 

\E[B f (X)]-E[B f (X')]\<6. (9) 

For j G [n], let 

A i = i (em x • A i = i (emxvjd'J 

so that 

n n 

B,(X) = J\A S , S / (X') = n' 4 f 

3 =1 3 =1 
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Since fj(X[£,j]) £ Ci for t £ [D ], it follows that Aj , A'- £ Ci. Since the fj(X[£,j])s are 
pairwise independent variables, 


E [Aj] = E [A'j], Var [Aj] = Var[A'-]. 


Note that 


n n n 

E lB f (X')} = E[J] A'\ = E[A$] = [] E[Aj], E [B f (X)} = E[Ai • • • A*]. 


2=1 

The random variables A±,.. 

Xar(Aj) 


j =i j =i 

., A n are k -wise independent. Further, we have 

t =1 


Therefore, by Lemma 4.1, 


( 10 ) 


E[j4i ■ ■ • A n ] — JJ E[Aj 


3 = 1 


< 




( 11 ) 


where the second to last inequality follows becase n < m 1//4 and D > y/m/2, and the last 
holds for k = Clog(l/e))/log(m) for a sufficiently big constant C. Equation 9 now follows 
from Equations (11) and (10). 

Finally, 


|E[/(Z)] -E[/(Z')]| = \nf X (Y)} -E[f x \Y')] 
= nf x (Y')]-E[f x '(Y')} 


+ 5 ’ 


= \E[B f (X)}-E[B f (X')}\+6' 
<5+ 5'. 


Equation (7) 

Equation (8) 
Equation (9) 


Hence the theorem is proved. 


□ 


7 Dimension reduction for low-variance Fourier shapes 

We next describe our dimension reduction step for low-variance Fourier shapes. We start 
with an (m, n)-Fourier shape where m < n 4 and Tvar(/) < log(n/5) c . We show how one 
can reduce the dimension to t = s/n, at a price of a blowup in the alphabet size m! which 
now becomes ( n/6) c for some (large) constant c. 
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Theorem 7.1. Let 5 > 0, n > 0 and t = [y^n]. There is a constant c and m! < ( n/6) c 
such that the following holds: if there exists an explicit PRG Q' : {0, l} r —>• [to 7 ]* with 
seed-length r' = r'(n,S') which 6'-fools (to/ ,t)-Fourier shapes, then there exists an explicit 
generator Q : {0,l} r —>• [m] n with seed-length r = r' + 0(log(n/<5)) which (5 + 5')-fools 
(m,n)-Fourier shapes f with m < n 4 and Tvar (/) < n 1 / 9 . 

We first set up some notation. Assume that we have fixed a hash function h : [n] —» [t]. 
For x € [m] n and j € [t], let a/ denote the projection of x onto co-ordinates in h~ l (j). For 
an (to, n)-Fourier shape / : [m] n —>• Ci with / = Il':Li fi, let 


= n 

i:h(i)=j 

t 

so that /(x) = f^{xP). 

3 = 1 


We start by constructing an easy to analyze generator Gi which hashes co-ordinates 
into buckets using fc-wise independence and then uses independent fc-wise independent 
strings within a bucket. Let 


k = c log jn/5) 
log(n) 


( 12 ) 


where C will is a sufficiently large constant. Let TL : {[n] —>• t} be a fc-wise independent 
family of hash functions. Let Go : {0, l} r ° —>• [m\ n be a fe-wise independent generator over 
[m] n . Define a new generator G\ : TL x ({0, l} r °) t — > [to]” as: 


Gi(h,zi, ...,z t ) = Z, where Z j = G 0 (zj ) V j € /]. (13) 


We argue that G/ fools (to, n)-Fourier shapes with small total variance as in the theo¬ 
rem. Our analysis proceeds as follows: 

• With high probability over h £ u TL, each of the f J, s has low variance except for a few 
heavy co-ordinates (roughly Tvar(/)/f after dropping k/2 heavy coordinates). 

• Within each bin we have k -wise independence, whereas the distributions across bins 
are independent. So even conditioned on the heavy co-ordinates in a bin, the re¬ 
maining distribution in the bin is A:/2-wise independent. Hence each f J is fooled by 
Lemma 4.1. 


However, the seed-length of G± is prohibitively large: since we use independent seeds across 
the various buckets, the resulting seed-length is 0{y/n log (n/S)). The crucial observation 
is that we can recycle the seeds for various buckets using a generator that fools (to', t)- 
Fourier shapes with m! = 2 r ° = poly(n/<5) and t = 0(y/n). Given such a generator 
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Q' : {0, l} r/ —>• [m'Y which 5-fools (mi, f)-Fourier shapes, our final generator for small- 
variance Fourier shapes is Q s : Li x {0, l} r —>• [m] n is defined as 

g(h,w) = G 1 (h,g'(w)). (i4) 

It is worth mentioning that even though the original Fourier shape / : [m] n —>• Ci has low 
total variance, the generator Q’ needs to fool all (m', t)-Fourier shapes, not just those with 
low variance. 

7.1 Analysis of the dimension-reduction step 

For a > 0, to be chosen later, let L = {j £ [n] : a 2 (fj) > a} denote the a-large indices 
and S = [n] \ L denote the small indices. We call a hash function h £ Li (a, (3)-good if the 
following two conditions hold for every bin h~ 3 (j) where j £ [i]: 

1. The bin does not have too many large indices: | h~ l (j) n L\ < k/2. 

2. The small indices in the bin have small total variance: 

E ct 2 c«^- 

£<£L:h{i)=j 

Using standard moment bounds for fc-wise independent hash functions one can show 
that h £ u 'H is (a, /3)-good with probability at least 1 — n for a = n~ n d) anc [ ft = 
n ~ : ^f 1 ). We defer the proof of the following Lemma to Appendix D. 

Lemma 7.2. Let Tvar(/) < n 1 ' 9 and let LI = {h : [n] —>• [t]} be a k-wise independent family 
of hash functions for t = @(y/n). Then h £ Li is (n~ 1 / 3 ,n~ 1 / 36 )-good with probability 
1 - 0(k) k / 2 n~ n ^. 

We next argue that if h £ Li is (a, /3)-good then, fc-wise independence is sufficient to 
fool / J for each j € [t]. 

Lemma 7.3. Let h £ Li be (a, f3)-good, and let j £ [i]. For Z’ ~ [m] n k-wise independent, 
and Z" £ u [m] n , 

| nf j (Z')} - nf j (Z")}\ < exp (0(k)) • 

Proof. Fix j £ [t]. By relabelling coordinates, let us assume that h~ l (j) = {1 
and L n h~ 1 (j) = {1,..., r}, where r < k/2. As Z' is fe-wise independent, (Z(,..., Z' r ) is 
uniformly distributed over [m] r . We couple Z' and Z" by taking Z[ = Z'f for i < r. Even 
after conditioning on these values, Z ' r+1 ,..., Z' n . are k/ 2-wise independent. 

Let Y t = fe(Z' e ) for l £ {r + 1,..., nj}. As h is ( a , /3)-good, 

n/t 

E < z 3 - 

i=r +1 
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Therefore, by Lemma 4.1, 


E 


n >• 

f=r +1 


n 


£=r +1 


<exp(0(fc)) 


(15) 


But since Z^ = Z^ for £ < r, we have 

r n 

nf j {z')) = j{nf{z' t m n y t \, 


e=i 


£=r +1 
r n j 

nf{z")] = \\nf(z' e )] n E\y t ], 


i=i 


l=r +1 


|e[/ j (z')] — e[/ j "(z ,, )]| = 


n E [/(z;)]E[ n n-unm)] n 


£=1 


e =i 


< 


n n m- n 

l=r +1 i=r +1 

< exp(0(fc)) -/3 n(fc) . 


Since \/(Z' e )\ < 1 
Equation (15) 


□ 

We use these lemmas to prove Theorem 7.1. 

Proof of Theorem 7.1. Let / : [m] n -> Ci be a Fourier shape with Tvar(/) < n 1//9 . Let 
G i be the generator in Equation (13) with parameters as above. We condition on h £ u PL 
being (n -1 / 3 , n _1 / 36 )-good; by Lemma 7.2 this only adds an additional 0(k) k / 2 n~^( k ^ to 
the error. We fix such a good hash function h. 

Recall that Gi(h,zi,...,zt) = Z where Z J = Go(zj) for j € [t]. Since the ZjS are 
independent, so are the Z J ’s. Hence, 


E[f(G 1 (h,z\...,z t ))} = HE [f\&)\. 

3= 1 
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By Lemma 7.3, for (n 1 / 3 ,n 1 / 36 )-good h, if Y [m]”, then 


[]E[f(^)]-[]E[f(F 
7= 1 7=i 

t—1 r t r+1 t 

n*!^)] n E if’( z ’)i -nE[/^yj)] n Ep^)] 


sE 

r=0 

t-1 


7=1 i=r+l 7=1 

< E l E [f r+ \Z r+1 )] - [f r+ \Y r+1 )] | 

r=0 

< exp(0(ife)) • 0{tn~ k/3e ). 


j=r +2 


Combining the above equations we get that for Y € u [m] n , 

z u ..., z t ))} - E[/(Y)]| < 0(k) k / 2 n ~+ exp (O(fc)) • 0(tn- fc / 36 ) < 5 (16) 


where the last inequality holds by taking C in Equation (12) to be a sufficiently large 
constant. 

We next derandomize the choice of the z J, s by using a PRG for appropriate Fourier 
shapes. Let tq be the seed-length of the generator Go obtained by setting k = C log (n/S )/(log n) 
as above, and let c be such that ro < clog(n/<5). Let 


m 


r 


2 r ° < 



C 


and identify \m'] with {0, l} r °. Given a hash function h € Ti, let us define f ] : [rn/] —>• Ci 
for j € [t] and / : [m'] n —>• Ci as 


t 

f j (zj) = f j (Go( Zj )), f{z) = n f j ( Zj ) 

i =1 


respectively. Observe that / is a Fourier shape, and 

t 

f(Gi(h, Zl ,...,zt)) = n PiGoizj)) = f(z). 

7=1 

By assumption, we have an explicit generator Q' : {0, l} r —» [m , ] t which Y-fools (m ; , t)- 
Fourier shapes. We claim that Q : ~H x {0, l} r —> [m] n defined as 

g s (h,w) = G 1 (h,g'(w)) 

( 5' + 5) fools small-variance (m, n)-Fourier shapes. 
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Since Q' fools (m 1 , f)-Fourier shapes, 


|E [f(G(h ,«;))] - E[/(Gi(h, zi,..., z t ))]| < <5'. 

By Equation (16), whenever Tvar(/) < log [n/8) c , 

\E[f(G 1 (h,z 1 ,...,z t ))\-E[f(Z')}\ <5. 

Combining these equations, 

\E[f(g(h,w))]-E[f(Z'))\<8' + 6. 

The seed-length required for Q s is 0(log(n/5)) for h and r' for w. 


□ 


8 Putting things together 

We put the pieces together and prove our main theorem, Theorem 1.1. We show the follow¬ 
ing lemma which allows simultaneous reduction in both the alphabet and the dimension, 
going from fooling (m, n)-Fourier shapes to fooling (n 2 , \y/n\ )-Fourier shapes. 

Lemma 8.1. Let 5 > 0, n > log c (l/5) for some sufficiently large constant C, and t = 
[y^n]. If there exists an explicit PRG Q" : {0, l} r —>• [m"] 1 with seed-length r" = r"(n,S) 
which 6-fools {m",t)-Fourier shapes for all m" < n 2 , then there exists an explicit generator 
Q : {0, l} r —>• [ m] n with seed-length r = r" + 0(log(mn/5) log log(mn)) which 45-fools 
(■ m,n)-Fourier shapes . 3 

Proof. Let r" be the seed-length required for Q" to have error 5. Let m! < ( n/5) c be as in 
the statement of 7.1. Applying Theorem 6.1 to Q", we get a generator Q' with seedlength 
r"-\-0(fog{n/5) log log(n/<5)) that 5' = 25-fools (m r , y/n) Fourier shapes. Invoking Theorem 
7.1 with Q' , we get an explicit generator Q s : {0, l} rs —>• [m] n which 35 fools (m, n)-Fourier 
shapes / : [m\ n —>• Ci with Tvar(/) < n 1//9 and m < n 4 , with seed-length 

r s = r" + 0(log(n/5) log log(n/5)). 

For m < n 4 , let Ge ■ {0, l} re —>• [m] n be a generator for large Fourier shapes as in The¬ 
orem 5.1, which 5-fools (m,n)-Fourier shapes / : [m] n —>• Ci with Tvar(/) > C'log J (l/5). 
Since m < n 4 , this generator requires seed-length 

ri = O(log{n/6) log log(l/5)). 

3 Comparing this to Theorem 7.1, the main difference is that we do not assume that Tvar(/) is small. 

Further, the generator Q" for small dimensions requires m" < n 2 , and our goal is to fool Fourier shapes in 
n dimensions with arbitrary alphabet size m. 
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Define the generator 


Gi®s{wi,w 2 ) = Ge(we) © G s {w s ) 

where the seeds W£ € {0, l} r£ and w s € {0, l} rs are chosen independently and © is inter¬ 
preted as the sum mod m. Note that the total seed-length is 

r e + r s = r" + 0(log(n/<5) log log(n/<5)). 

We now analyze Gi® s - Let Y = Gi{w\) and Z = Gi(wi) and let X £ u [m] n . Fix an 
(m, n)-Fourier shape / : [m] n —>• Ci- We consider two cases based on Tvar(/): 

Case 1: Tvar(/) > Clog(l/<5) 5 . For any 2 € [m] n , define a new Fourier shape f z (y) = 
f(y © z). Then, for any fixed z, Y h-fools f z as Tvar(/~) = Tvar(/) > C\og(\/5) b . There¬ 
fore, 

IE[/(y © Z)\ - E[f(X)}\ < E |E lfz(Y)} - E[/(X)]| < 5. 

Z 

Case 2: Tvar(/) < n 1//9 . Consider a fixing y of Y and define f y (Z) = f(y © Z). Then, 
for any fixed y, Z 3h-fools f y as Tvar (f y ) < n 1 / 9 . Therefore, 

\E[f(Y(BZ)\-E[f(X)}\ <E|E[/ r (Z)]-E[/(X)]| <35. 

In either case, we have 

\E[f(Y®Z)\-E[f(X)}\<38. 

Finally, for arbitrary m, by applying Theorem 6.1 to Ge^ s , we get a generator G ■ 
{0, l} r —>• [m] n that 4 5 fools (m, n)-Fourier shapes with seed-length 

0(log(m/S) log log(m)) + m + r s = r" + 0(\og(nm/5) log log(nm/<5)). 

□ 


We prove Theorem 1.1 by repeated applications of this lemma. 


Proof of Theorem 1.1. Assume that the final error desired is S'. Let 5 = hY41og log(n). 

Applying Lemma 8.1, by using 0(log(mn/5') log log (mn/5')) random bits we reduce fooling 
(m, n)-Fourier shapes to fooling ( m\y/n] )-Fourier shapes for m' < rtf. 

We now apply the lemma 0(log log n) times to reduce to the case of fooling (log c '(l/5), log'Yl/d))- 
Fourier shapes. This can be done by noting that by Lemma 3.2 it suffices to fool Fourier 
shapes with log(/j) having 0(log(l/<5)) bits of precision. Such Fourier shapes can be com¬ 
puted by width-0(log(l/<5)) ROBPs, and thus using the generator from Theorem 3.3, we 
can fool this case with seed length 0(log(l/<5) log log(l/<5)) bits. Since each step requires 
0(log(n/h) log log(n/d) random bits, the overall seedlength is bounded by 

0(\og{mn/5) log log {mn/5) + 0(log(n/<5)(log log(n/<5)) 2 ). 


□ 


32 


9 Applications of PRGs for Fourier shapes 

In this Section, we show how Theorem 1.1 implies near optimal PRGs for halfspaces, modu¬ 
lar tests and combinatorial shapes. We first prove two technical lemmas relating closeness 
between Fourier transforms of integer valued random variables to closeness under other 
metrics. We define the Fourier distance, statistical distance and Kolmogorov distance 
between two integer-valued random variables respectively as 

Z 2 ) = max |E[exp(27T*aZi)l — E[exp(27r*aZ 2 )l|, (17) 

«e[o,i] 

d TV (Z 1 ,Z 2 ) = I Pr ( Z i = j) - Pl '( Z 2 = j) I, (18) 

1 jez 

dx(Zi,Z 2 ) = max(| Pr(Zi < k) — Pr(Z 2 < k)\) (19) 

The first standard claim relates closeness in statistical distance and Fourier distance 
for bounded integer valued random variables. 

Lemma 9.1. Let Z\,Z 2 be two integer-valued, random variables supported on [0, N]. Then, 

dxv{Zi, Z 2 ) < 0(VN) ■ dpT{Z\, Z 2 ). 

Proof. Note that the distribution Z\ — Z 2 is supported on at most 41V+1 points. Therefore, 

d TV {z x ,z 2 ) = II z x - z 2 Hi < vWTTlIZi - z 2 || 2 . 

On the other hand, the Plancherel identity implies that 

||Zi — Z 2 || 2 < dFr(Zi, Z 2 ). 


This completes the proof. □ 

The second claim relates closeness in Kolmogorov distance to closeness in Fourier dis¬ 
tance. The key is that unlike in Lemma 9.1, the dependence on N is logarithmic. This 
difference is crucial to fooling halfspaces with polynomially small error (since there N can 
exponential in the dimension n). 

Lemma 9.2. Let Z\,Z 2 be two integer-valued random variables supported on [— N, N], 
Then, 

d K (Zi,Z 2 ) < 0(log(JV) • d FT {Zi,Z 2 )). 

Proof. By definition we have that 

dx(Zi,Z 2 ) = max (| Pr(Zi < k ) — Pr(Z 2 < k)\). 
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We note that 


Pr {Zi <k)= J 2 P Ai = j) 

j=-N 
^ ^ Pi 

= 2 _, / exp(— 2 m j a) K[exp(27riaZi)\da 
j=—N ® 

= / s(/c, IV, a) E[exp(27riaZ,,)]da 

Jo 

where 

k 

s(k,N,a)= ^2 ex.p(—2irija). 

j=—N 

It is clear that |s(fc,IV,a)| < 21V. Further, 


\s(k, N,a)\ = 


exp(—27rifca)(exp(27ri(A r + A: + l)a) — 1) 


- 7 —“—\- 7 < 0 (-^ 

exp(27r*a) — 1| [aj 


exp(27r/ai) — 1 

where [a] is the distance between a and the nearest integer. Therefore, we have 
I Pr(Zi < k) — Pr (^2 < k)\ < f |s(&, N, a)| |E[exp(27riaZi)] — E[exp(27riaZi)]| da 

Jo 


min ( N, -^r ] ) dp-r{Z\. Z 2 )da 


<-!'A 

( rl/N r 1/2 da rl—X/N d rl 

= 0(d FT (Zi,Z 2 ))[ / Nda + / — + / --+ / 

\J o Ji/n a J 1/2 l —a J i_ 


= 0(d FT (Z 1 ,Z 2 )log(N)). 


□ 


9.1 Corollaries of the main result 

We combine Lemma 9.2 with Theorem 1.1 to derive Corollary 1.2, which gives PRGs for 
halfspaces with polynomially small error from PRGs for (2, n)-Fourier shapes. 

Proof of Corollary 1.2. Let Q : {0, l} r —»■ {±l} n be a PRG which d-fools (2, n)-Fourier 
shapes (here we identify [2] with {±1} arbitrarily). We claim that Q also fools all halfspaces 
with error at most e = 0(nlog(n)5). 

Let h : {±l} n —>• {±1} be a halfspace given by h{x) = 1 + ((w,x) — 0). It is well known 
that we can assume the weights and the threshold 6 to be integers bounded in the range 


Nda 
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[~N,N] for N = 2 °( nlogri ) (cf. [LC67]). Let X € u {±l} n and Y = Q{y) for y £ u {0, l} r 
and Z\ = ( w , X), Z 2 = (w, Y). Note that Z \, Z 2 are bounded in the range [-n-N,n-N]. 
We first claim that 

(Lft{Z\, Z 2 ) < S. 

For a € [0,1], we define f a : {±l} n —>• Ci as 


n 

fa (x) = ex.p(2iria(w, x)) = exp(27r iawjXj) (20) 

3 = 1 


then f Q is a (2, n)-Fourier shape. Hence, 

\E[f a (X)]-E[f a (Y)]\<6. 

That cIft(Z 1 , Z 2 ) < 5 now follows from the dehnition of Fourier distance, and the fact that 
E [f a (X)] and E [f a (Y)] are the Fourier transforms of X and Y at a respectively. 

Therefore, by Lemma 9.2 applied to Z\, Z 2 , dK{Z\,Z 2 ) < 0{n\ogn)5. Finally, note 
that 

|E[h(X)] -E[h(y)]| < d K ((w,X),(w,Y)) = d K (Z 1 ,Z 2 ) < 0(nlog n)5. 

The corollary now follows by picking a generator as in Theorem 1.1 for m = 2 with 
error 5 = e/(Cn log n) for sufficiently big C. □ 

To prove Corollary 1.3, we need the following lemma about generalized halfspaces. 

Lemma 9.3. In Definition 3, we may assume that each g t (j ) is an integer of absolute 
value 

Proof. Let g : [m] n —» {0,1} be a generalized halfspace where the < 7 ,;S are arbitrary. Embed 
[m\ n into {0, l} mn by sending each Xi € [m] to (yi t i, ..., yi >m ) where y t .j = 1 if X{ = j and 
y t j = 0 otherwise. Note that 

n n m 

9i{j)Vi,j 

i= 1 i=l j=l 

However, the halfspace 

n m 

EE 9i(j)yi,j ^ 6 

i =1 j= 1 

over the domain {0, l} m?l has a representation where the weights g'fij) and 6' are integers 
of size at most Hence we can replace each gi(j) in the defintion of g with g'fij) 

without changing its value at any point in [m] n . □ 

We now prove Corollary 1.3 giving PRGs for generalized halfspaces over [m] n . 
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Proof of Corollary 1.3. Letting X G u [m] n and letting X' be obtained from a PRG for 
(m, n)-Fourier shapes with error at most e, we let Z\ = 9% (W,- ) and Z 2 = Yli9i(X'i)- 

By Lemma 9.2 that dx(Z\, Z 2 ) < 0(snm log (nm)). Picking e sufficiently small gives our 
generator for generalized halfspaces. □ 

Next we use Corollary 1.3 to get PRGs fooling halfspaces under general product dis¬ 
tributions. From the definition of generalized halfspaces, it follows that if V is a discrete 
product distribution on M n where each co-ordinate can be sampled using log(m) bits, then 
fooling halfspaces under T> reduces to fooling generalized halfspaces over [m] n for some 
suitable choice of g % . In fact [GOWZIO] showed that fooling such distributions is in fact 
sufficient to sufficient to fool continuous product distributions with bounded moments. The 
following is a restatement of [GOWZIO, Lemma 6.1]. 

Lemma 9.4. Let X be a product distribution on M” such that for all i € [n\, 

E[Xi] = 0,E[X 2 ] = l,E[Xf] < C. 

Then there exists a discrete product distribution Y such that for every halfspace h, 

\E[h(X)} -E[/i(y)]| < e. 

Further, each Yj can be sampled using log(n, 1/e, C) random bits. 

Note that the first and second moment conditions on X can be obtained for any prod¬ 
uct distribution by an affine transformation. Hence we get Corollary 1.4 from combining 
Lemma 9.4 with Corollary 1.3. In particular, there exist generators that fool all halfspaces 
with error e under the Gaussian distribution with seed-length r = 0(log(n/e)(log log(ri/e:)) 2 ) 
This nearly matches the recent result of [KM15] upto a log log factor. Further, it is known 
(see e.g [GOWZIO, Lemma 11.1]) that PRGs for halfspaces under the Gaussian distribution 
imply PRGs for halfspaces over the sphere. 

We next prove Corollary 1.5 which derandomizes the Chernoff bound. 

Proof of Corollary 1.5. First note that we can assume without loss of generality that each 
X t can be sampled with r x = 0(log(mn/e)) bits (by ignoring elements which happen 
with smaller probability). In particular, let each X t have the same distribution as hi{Z) 
for Z G u \m'\ where m' = 2 Tx (here we identify [rn 1 ] with {0,1} 7-1 ) and some function 
hi : [m!] —>• [m\. Let Q : {0, l} r —>• [ m'] n be a PRG which (e/2)-fools im', n)-generalized 
halfspaces. Now, let Y = (h\(Zi), ^ 2 (^ 2 ), • • •, h n (Z n )), where (Zi,..., Z n ) = Q(w ) for 

i»e.{o,i} r . 

Note that Y can be sampled with 0(log(mn/e) • (log log 2 (mn/e))) random bits. We 
claim that Y satisfies the required guarantees. To see this, define the generalized halfspaces 

, g-(z) = t + ( ^2~gi(hi(zi)) + 6 

\i =1 



g + (z) = V 


f n 

E 


9i(hi(zi )) - 
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where 


e = t+j2 E i9i(Xi)] = t+j2n9i(yi)}- 

i i— 1 

From Corollary 1.3 it follows that 

\E[g + (X)} -E[ 5+ (Y)]| < e/2, | E\g.(X)] - E[s_(Y)]| < e/2. 

From the Chernoff-Hoeffding bound [Hoe63], we have 

< 2 e~ t2/2n . 


ng + (X)+g_(X)]=Pr 
Hence by the triangle inequality, 


n9i( Xi )\ 


2=1 


2=1 


> t 


Pr 




2=1 


2=1 


> t 


= E [g + (Y) + g_(Y)] < 2e~ t2 ' 2n + e. 


□ 


We next prove Corollary 1.6 about fooling modular tests. 

Proof of Corollary 1.6. Let Q : {0, l} r —> {0, \} n be a PRG which fools (2, n)-Fourier shapes 
with error e/y/Mn. We claim that Q fools modular tests with error at most e. 

Let g(x) = l(£h aiXi mod M € S) be a modular test, let X E u {0, l} n and Y = G(y) 
for y G u {0, l} r . In order to fools modular tests, it suffices that 

drv(^2 aiXi ’ aiYi ^> - £ ' 

2 2 

On the other hand, since both these random variables are bounded in the range {0, Mn}, 
by Lemma 9.1 

drv aiXi ' X] aiY ^j - ■ dp T aiXi, e 

where the last inequality uses the fact that the Fourier transforms of both random variables 
are (2, n)-Fourier shapes by Equation (20). □ 

Next we prove Corollary 1.7 giving PRGs from combinatorial shapes. 
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Proof of Corollary 1.7. Recall that a combinatorial shape / : [m] n —>• {0,1} is a function 


f(x) = h 

where fji : [m] —>• {0,1} and h : { 0,..., n} —>• {0,1}. Since ]>T 9 i( x i ) E {0,..., n}, it suffices 
to fool the generalized halfspaces 


f( x ) = gi ( Xi ) “ 0 

i 

for 0 £ {0,... ,n} each with error e/n. Hence the claim follows from Corollary 1.3 about 
fooling generalized halfspaces. □ 
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A Proofs from Section 3 


Proof of Lemma 3.5. Let Ij t, be the indicator function of the event that h(i) = k. Note 
that h{y) = Y Jl . h k I hk I ]P v ‘t v ]- Therefore, 

h(v) p = y y 


Let R(it,jt , kt ) be 0 if for some t, t' kt / but one of b or jt equals or j t > and otherwise 
be equal to m~ T where T is the number of distinct values taken by it or ft- Notice that 
by the <5-biasedness of h that 


E 


p 

I~J 

_t =l 


< R(i t ,jt,k t ) + 5. 


Combining with the above we find that 


E[h{v) p ]< Y Y ( R (k,jt,h) + 5) 

^lv^piiiviip k\,...,kp t= 1 

< Y Y R(H-Juh)\[vlvl+8m p Y, U v l v l 

ki,...,kp t= 1 ii,...,ip,ji,...J p t=l 

< Y Y R ^t,jt,h)W_vf t v 2 jt +5m p \\v\\z p . 

il,...,ip,jl,...,jp k\,...,kp t =1 


Next we consider 

Y R (k,jt,k t ) 

for fixed values of ii, ■ ■ ■ ,i p ,ji, ■ ■ ■ ,j p - We claim that it is at most m~ s / 2 where S is 
again the number of distinct elements of the form it or jt that appear in this way an odd 
number of times. Letting T be the number of distinct elements of the form it or jt, the 
expression in question is m~ T times the number of choices of kt so that each value of it 
or jt appears with only one value of kt- In other words this is m~ T times the number of 
functions / : {it,jt} —>• [m] so that /(it) = f(jt) for all t. This last relation splits { it,jt } 
into equivalence classes given by the transitive closure of the operation that x ~ y if x = it 
and y = jt for some t. We note that any x that appears an odd number of times as an 
it or jt must be in an equivalence class of size at least 2 because it must appear at least 
once with some other element. Therefore, the number of equivalence classes, E is at least 
T — S/2. Thus, the sum in question is at most m~ T m E < m~ s / 2 . Therefore, we have that 


E[h(u) p ] < (2 p)\ Y m -(Odd(M)}/2 j-j v 2 + 6m P || v ||4p. 

Multisets M<z[n\,\M\=2 P i£M 
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Where Odd(M) is the number of elements occurring in M an odd number of times. This 
equals 


p 

E[h(v) p ] < (2 p)\ ^2 Y1 m ~ k II v i ^ mP IMl 2 P 

k=0 Multisets Mc[n],\M\=2p,Odd(M)=2k zGM 

< o) ! j2 n,rk y y n^n^+^N? 

k=o iiviip-fc 

= (2p)! y; \\v\\f p ~ k ' > + 6m p \\v\\^ p 

k =o V m / 

< 0 (p) 2p ^ + 0 (p) 2p \\v\\ Ap + 5m p ||u||2 P . 

Note that the second line above comes from taking M to be the multiset 


{fli *2i • • ■ )*2fc> hi jh 32, hi ■ ■ • ) jp-kijp-k}- 
This completes our proof. 


□ 


Proof of Lemma 3.6. Let X, denote the indicator random variable which is 1 if h(i) = j 
and 0 otherwise. Let Z = ]TL v l X l . Now, if h were a truly random hash function, then, by 
Hoeffding’s inequality, 

Pr [|Z — Hul^ /m\ > t] < 2 exp ( —t 2 / 2 ^ v 2 J . 


Therefore, for a truly random hash function and even integer p > 2, \\Z\\ p = 0(||u|| 2 )-y/p. 
Therefore, for a 5-biased hash family, we get \\Z\\ P < 0(p) p / 2 ||u ||2 + ||u|| p <5. Hence, by 
Markov’s inequality, for any t > 0, 


Pr [|Z — 11^1^ jm\ > t] < 


0 (p) p / 2 ||u||2 + IHli 5 


tp 


□ 


B Proofs from Section 4 


Proof of Lemma f.2. First we note that since for any complex random variable, Z, that 


E 


\Z\* 


2°WE[\$t(Z)\ k + \Q(Z)\ k ] 
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and Var (Z) = Var (ffl(Z)) + Var(9(Z)), it suffices to prove our lemma when Z is a real¬ 
valued random variable. 

We can now compute the expectation of (JT Zi) k by expanding out the polynomial in 
question and computing the expectation of each term individually. In particular, we have 
that 


E 


E z < 


E E 


^ 1 i“-^k 


k 

n z , 

3=1 


Next we group the terms above by the set S of indices that occur as ij for some j. Thus, 
we get 


EE E E 

m= 1 \S\=mii,...,ik£S 
{ij}=S 


k 

3=1 


We note that the expectation in question is 0 unless for each j € S, Zj occurs at least 
twice in the product. Therefore, the expectation is 0 unless m < k/2 and overall is at most 
B k ~ 2m ri/eS Var(Zj). Thus, the expectation in question is at most 

k/2 

E E mk B k ~ 2rn I] Var(Zj). 

m =1 |5|=m jeS 

Next, note that by expanding out (JT Var(Zj)) m we find that cr 2m > ml X)|S|=m rijes Var(Zj). 
Therefore, the expectation in question is at most 

k/2 k/2 


'y ^ < 20 (k) yj^k—mj^k—2m^.2m ^ ^(k) ^ j,k—mj^k—2m^'. 


2m 


rn 1 


m =0 


< 2°W [k k/2 a k + k k B k ^j 

< 2 0( - k \aVk + Bk) k , 


as desired. 


□ 


C Proofs from Section 5 

Proof of Lemma 5.f. First note that t e {0,... , T} satisfying the hypothesis exists since 
|ju|j 9 € [1, n]. For t € [n], let I(£) be the indicator random variable which is 1 if i € Bf. 
Since \B t \ = 2*, Pr[/(£) = 1] = 2 t /n. If we set V = ||ut|| 2 , 

v = Y J vim 










By the pairwise independence of a, 


e \v 2 } = Y J vtm+ y 

i+i'= i 


i=i 


n nt n o2i 

e= i t±t '=i 

< - |Mlt + E[E] 2 . 

n 


Therefore, 

Var(C) = E{V 2 ] - E[V] 2 < * |M | 4 < LM IMlL < ^ 
Thus, by Chebyshev’s inequality, 


Pr[|E-E[E]| > 1/3] < 9/16 

In particular, with probability at least 7/16, V = ||u t || 2 € [1/6,4/3]. □ 

Proof of Lemma 5.6. Let i = 2C\ log(l/<5) and T = 0(log 5 (l/<5)) to be chosen later. Let 
Pi = {h : [n] —>• [T]} to be a d'-biased family for 5' = exp(—C(log(l/<5))) for C a sufficiently 
large constant. 

Let p = clog(l/<5)/log log(l/<5)) for a constant c to be chosen later. Let v € [0, l] n 
with ||u || 2 > C* 2 log 0 (1/(5) and note that if — |M| 2 /^ f° r some 3 € [T], then 

h(y) > ||u || 2 /^ 2 (recall the definition of h(v) from Equation (2)). Therefore, by Lemma 3.5 
and Markov’s inequality, the probability that this happens is at most 


E [h{v)P]£ 2 P ( £ 2 p 


,4 \P 


. < 

14 p — 


14 p 


0 (p) 


2 P 


< o 


P 2 p 


•2 \P 


+ o 


'pH 2 V 

.Ml) 


+ 0{p) 2 P\\v\\f + TP \\v\\f 5' 


+ T p £ 2p 8' 


<0(log(l/5))-P + 0(log(l/5)) 7 P8' 


for a suitable choice of the constant c and 5' = exp(—CTog(l/(5)). 

Now suppose that \\vh~ 1 (j)\\l < \MlH f or all j € [T\. Let I = {j : ||u ft -i(j )|| 2 > 
IHI 2 / 2 T}. Then, 

\\v\\l<\I\-(\\v\\l/t) + T\\v\\l/{2T). 

Therefore, we must have |/| > t/2. This proves the claim. □ 
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D Proofs from Section 7 


Proof. Let a = n 1//3 , /3 = n 1/1,36 . 

Note that \L\ < Tvar {f)/a < n 2//9 . Since h E u Pi is k-wise independent, for any index 
3 e Mi 

Pr[|Ln/i _ 1 (j)| > m < ($) (i )‘ /2 < (^)‘ /2 i }) 1 ' 2 £ O (n-/‘f /2 . 

( 21 ) 


Define v € M n by Vj = <J 2 (fj ) if j E S' and 0 otherwise. Now, 


W o = 


Y a 4 (fi) ^ ma .F 2 (/i) ^V.?) - Tvar (/) a - 

ies Ji ' S jes 


By Lemma 3.6 applied to v, we get that for any j E [i], 


Pr 


^ j 2 (a) ^ Tvar(/) | (Tvar(/)) 1 / 2 q 1 / 4 

£<ES:h{l)=j t 


< 0(k) k / 2 a k/4 = 0 (k) k/2 n~^ k) . 


( 22 ) 


This completes the proof. 


□ 
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